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Real Party in Interest 

Amersham Biosciences AB, formerly known as Amersham Pharmacia Biotech 
AB, is the assignee and owner of the captioned application, is the real party in interest to 
this appeal. 

Related Appeals and Interferences 

There are no other appeals or interferences related to the instant appeal. 

Status of Claims 

Claims 1-12 and 15-20 are pending in the captioned application. These are the 
only pending claims subject to examination before the U.S. Patent and Trademark Office. 
A copy of these claims is appended hereto. 

Status of Amendments 

There are no outstanding amendments with regard to the captioned application. 
However, Appellant has submitted proof of the abandonment of copending patent 
application number 10/478,910, and thus has obviated the Examiner's grounds for the 
obviousness-type double patenting rejection. This rejection is not addressed in the instant 
Appeal. 

Summary of Claimed Subject Matter 

The instant invention relates to methodology and reagents for use in identification 
of peptides or polypeptide using mass spectrometric technique, which reagents are more 
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environmentally acceptable than prior use of organic solvents. The reagents include a a 
novel class of water-stable derivatization reagents, which include a sulfonyl moiety 
coupled to an activated acid moiety, to provide peptide derivatives for analysis in 
subsequent mass spectrometry steps. The invention further provides a simplified and 
easily automated procedure for derivatizing peptides or polypeptides prior to such 
identification (page 3, lines 5-15). 

The methodology for identification of peptides or polypeptides comprises the 
steps of: (a) derivatization in an aqueous solution the N-terminus of the polypeptide, or 
the N-termini of one or more peptides of the polypeptide, with at least one acidic reagent 
comprising a sulfonyl moiety coupled to an activated acid moiety to provide one or more 
peptide derivatives, which reagent exhibits a half-life in aqueous solution of not less than 
1 0 minutes, preferably not less than about 20 minutes and most preferably not less than 
about 30 minutes at room temperature; (b) analyzing at least one such derivative using a 
mass spectrometric technique to provide a fragmentation pattern; and (c) interpreting the 
fragmentation pattern obtained (page 3, lines 15-26). 

Independent claim 1 recites this as the above-referenced method; claims 2-12 
which are directly or ultimately dependent on claim 1, further define properties of the 
acidic reagent (see claim 2 and page 10, line 19); claims 3 and 4 recite particular mass 
spectrometric techniques which can be utilized (see e.g., page 8, line 6); claims 5 and 6 
recite specific automation use in the process (see e.g., page 10, line 1-10); claims 7 to 
and 9 define suitable activated acid moieties (see e.g., page 10, line 25 to page 11, line 5) 
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and claims 10 to 12 define methodology for obtaining the peptides and peptides of 
interest (see e.g.. page 14, line 9 to page 15, line 20). Further claim 15 defines a kit 
useful for the methodology (see e.g., page 16, line 5 -10) and claims 16 to 20 recite the 
kit containing the limitations of claims 2 to 9 described supra . 

Grounds of Rejection to be Reviewed on Appeal 

1. Whether claims 1-6, 10-13 and 15-18 are properly rejected under 35 U.S.C. § 
102(a) as being anticipated by Keough, et al (WO 00/43792). 

2. Whether claims 7-9 and 19 are properly rejected under 35 U.S.C. § 103(a) as 
being unpatentable over Keough, et al in view of Little et al (USP 6,322,970). 

Argument 

1. Claims 1^ and 10-13 and 15-18 are not properly rejected under 35 U.S.C. § 
102(a) as being anticipated by Keough, et al (WO 00/43792). 

The Examiner has rejected claims 1-6, 10-13 and 15-18 under 35 U.S.C. § 
102(a) as "being anticipated by Keough et al (WO 00/43792)". 

Specifically, the Examiner stated, "Keough et al anticipates the instant invention 
by teaching a method of identifying a polypeptide by derivatizing the N-termini of one or 
more peptides with one or more acidic moieties having a pKas of less than about 2 when 
coupled with the polypeptide or peptide to provide one or more derivatized analytes. 
This method is analyzed by mass spectrometric techniques to provide a fragmentation 
pattem (summary)". 
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The Examiner continued, "the fragmentation pattern of the peptide is interpreted 
to sequence the polypeptide (page 7, lines 1-2). Thomas et al discloses coupling an 
acidic moiety reagent to the N-terminus of a cysteine-containing peptide, followed by 
oxidation to produce peptides containing two acidic moieties (sulfonic acids). The 
preferred acidic moieties are 2-sulfoacetyl, 2-sulfobenzoyl and 3-sulfopropionyl moieties 
(page 9, lines 1-15). These preferred acidic moieties are sulfonyls coupled to an ester 
moiety such as sulfosuccinic anhydride, and 2-sulfobenzoic acid cyclic anhydride and 
others (page 9, lines 1-30)". 

The Examiner conceded, "Keough et al is silent with respect to the half-life of the 
acid reagent not being less than 10 minutes, however, it is the Examiner's position that 
this teaching is inherent to what the instant reference teaches. Keough et al teaches the 
acid reagents utilized in the instant invention, therefore these reagents will inherently 
exhibit a half-life in aqueous solution of not less than 10 minutes". 

In response. Appellant respectfully reiterated the arguments that the recited acidic 
reagent contains "a sulfonyl moiety coupled to an ester moiety"; such is neither taught, 
nor even suggested, by the Keough, et al. reference. While Appellant conceded that 
Keough, et al. does teach coupling of sulfonyls to anhydride reagents, Appellant asserted 
that anhydrides are not esters. The benefits of use of the reagents of the instant invention 
are disclosed at pages 8-11 of the captioned application, and include aqueous stability. 
Indeed, as stated at page 9, lines 5-11, one of the advantages of the instant invention 
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compared to the disclosure of the Keough, et al. reference "resides in the fact that 
according to the present invention all steps can be carried out under aqueous conditions. 
As previously suggested technology required to dry-down steps and several small pH 
changes from basic to acidic, and vice versa, the present method is much more amenable 
to automation". 

In response to this argument, the Examiner stated, "although the anhydride 
reagents recited by Keough are not esters; a sulfonyl group coupled to an ester is found 
v^ithin the 3-sulfopropionic anhydride structures, (see example 8, page 14) and as 
evidenced by Appellant's instant specification on page 23 which describes the synthesis 
of a NHS ester from a 3-sulfopropianic anhydride. Although the reference of Keough et 
al did not synthesize an NHS ester from 3-sulfopropionic anhydride, it is apparent that an 
ester coupled to a sulfonyl group is within the structure of the 3-sulfopropionic 
anhydride. The instant claim 1 calls for at least one acidic reagent containing a sulfonyl 
moiety coupled to an ester moiety, therefore, Keough satisfies the limitation of claim 1". 

In response. Appellant pointed out that the compound disclosed at page 14, 
example 8 of the Keough, et al. reference is 3-sulfopropionic anhydride. Even if 
Appellant were to accept the Examiner's contention that this compound embodies "a 
reagent containing a sulfonyl moiety coupled to an ester moiety", which Appellant did 
not concede, anhydrides are very sensitive to water and would not exhibit the half-life 
recited in the claim "a half-life in aqueous solution of not less than 10 minutes at room 
temperature". Indeed, as stated at page 24, lines 23-25 of the captioned application, 3- 
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sulfopropionic anhydride "was very sensitive to water and it was necessary to dry all 
equipment in an oven before use and to do the reaction and purification under an argon 
atmosphere". 

Further, at page 3, lines 5-10 of the captioned application, it is stated that one 
objective of the instant invention is achieved by "using a novel class of water-stable 
derivatization reagents, which comprise a sulfonyl moiety coupled to an activated acid 
moiety". Thus, the water stability recited in the claim is an inherent characteristic of the 
reagents used in the methodology of the instant invention. Such water stability is neither 
disclosed nor even suggested by the Keough, et al. reference. 

As for the Examiner's statement regarding Appellant's assertion that all the steps 
can be carried out under aqueous conditions making it amenable to automation, versus 
the previous technology which requires dry-down steps and stated, "This assertion is 
noted but not found persuasive because these benefits are not found in the claims", 
Appellant respectfully directs the Examiner's attention to the statement in claim 1(a), 
which states that the reagent exhibits "a half-life in aqueous solution. . ." Such benefits 
are clearly a result of this stability in aqueous solution. 

In response to these arguments, the Examiner directed Appellant's attention to 
"the examples on page 12, example 6", and suggested that Appellant "compare it with 
applicant's example in the specification on page 23". The Examiner continued, "This 
reagent is not an anhydride; it has a sulfonyl group with an ester group attached, and 
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therefore it would appear that this reagent would have the same benefits as the aqueous 
stability as recited in the instant claim 1". 

In response, Appellant respectfully points out that the reagent in Example 6, i.e., 
the material (which is recited with the peptide) does not include "a sulfonyl moiety 
coupled to an ester moiety" as recited in instant claim 1 . Specifically, Appellant 
respectfully asserts that such is critical to the instant invention since the purpose of the 
sulfonyl moiety permits stabilization of an acidic moiety with the desired pKa value 
while having a desirable half-life in the solution. While the ultimate reacted product 
contains a peptide coupled to an organic chain including a sulfonyl moiety, it is obtained 
by quite different means. Indeed, as stated at page 13, after the reaction, the sample is 
"desalted and oxidized with 10 p.L of formic acid (88%):H202 (30%) prepared at a ratio 
of 19:1 (v:v). Oxidation proceeds for 30 minutes at room temperature, and the derivatized 
analyte is dried prior to mass spectral analysis". Appellant respectfully submits that this 
is quite different from the instant invention since such harsh oxidation treatment will 
likely cause side reactions which could change the peptide structure and therefore give 
false results in mass spectrometry. 

Appellant, thus, asserts that it is only with the teachings of the instant invention 
that a suitable methodology is obtained, and it is only with the use of the specific reagents 
recited within the methodology that such is attainable. Far from being a routine property, 
Appellants respectfully assert that the specific properties of the reagents in this invention 
are neither disclosed nor even suggested by the Keough, et al. reference. 
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In view of the foregoing, Appellant respectfully submits that the Examiner's 
rejection cannot be sustained and should be reversed. 

2. Claims 7-9 and 19 are not properly rejected under 35 U.S.C. § 103(a) as 

being unpatentable over Keough, et al (WO 00/43792) in view of Little, et al 
(USP 6,322,970). 

The Examiner has rejected claims 7-9 and 19 under 35 U.S.C. § 103(a) as "being 
unpatentable over Keough et al in view of Little et al (USP#6,322,970)". 

Specifically, the Examiner stated, "the teachings of Keough et al are set forth 
above and differ from the instant claims by not disclosing a sulfonyl group being coupled 
to a particular ester such as N-hyroxysuccinimide (NHS) ester". 

The Examiner continued, "Little et al discloses a process for determining the 
identity of a target polypeptide using mass spectroscopy (abstract). Little et al discloses 
that target polypeptides can be captured by conjugation to a solid support by 
immobilizing. The conjugation can be mediated through a linker such as a sulfo-N- 
hydroxysuccinimide (NHS) ester that facilitates conjugation of the polypeptide through it 
amino terminus. . ." The Examiner further stated, "claim 12 recites a step of protecting 
lysine residues prior to derivatizing. Little et al discloses that the termini of a target 
polypeptide are more reactive than the amino acid side groups and therefore the amino 
acid residues should be blocked prior to performing the reaction of interest (column 60, 
lines 27-50)". 
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The Examiner concluded, "it would have been obvious to one of ordinary skill in 
the art to modify the reference of Keough et al to couple an N-hydroxysuccinimide 
(NHS) ester to a sulfonyl group taught by Little et al to facilitate conjugation of peptides 
to a solid support which has the advantage of being manipulated so that reagents and 
undesirable reaction products can be washed from the remaining immobilized 
polypeptide, which can then be cleaved from the solid support and analyzed by mass 
spectrometry (column 60, lines 17-27)". 

In response. Appellant respectfully pointed out that the Little, et al. reference 
discloses the utilization of NHS esters to facilitate attachment to a solid support. As 
stated at column 4, lines 38-52, this immobilization provides "a means to isolate the 
polypeptide, as well as a means to manipulate the isolated target polypeptide prior to 
mass spectrometry". Such is quite different from the instant invention wherein the 
derivatization is done in aqueous solution and does not require attachment to a solid 
support. The benefits of such aqueous solution derivatization are discussed above, and 
are neither disclosed nor even suggested by either the Keough, et al. or Little, et al. 
references. 

In response to this argument, the Examiner has stated, "Applicant['s] argument 
that the references of Keough et al and Little et al does not disclose or suggest 
derivatization of the peptides in aqueous solution. This argument is noted but not found 
to be persuasive". 
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The Examiner continued, "In response to applicant's arguments against the 
references individually, one cannot show nonobviousness by attacking references 
individually where the rejections are based on combinations of references. See In re 
Keller, 642 F.2d 413, 208 USPQ 871 (CCPA 1981); In re Merck & Co,, 800 F.2d 1091, 
23 1 USPQ 375 (Fed. Cir. 1986). Keough et al teaches derivatization in an aqueous 
solution, which are called for by the instant claim 1 (see page 5, lines 11-12). The 
reference of Little et al was relied on for its teaching of a particular (NHS) ester group, 
which discloses that (NHS) esters facilitate conjugation of the polypeptide to a solid 
support during the method of derivatization of the polypeptide through its amino or 
carboxyl terminus. Although applicant contends that the instant invention is different 
from the prior art of Keough et al and Little et al, these differences do not appear in the 
claims. Therefore, it appears to the examiner that the combination of the references is 
proper". 

In response, Appellant reiterated that the references, alone or in combination with 
one another, neither disclose nor even suggest derivatizing polypeptides in aqueous 
solution, nor do they disclose or even suggest that the reagents for derivatization exhibit a 
half-life in aqueous solution of not less than 10 minutes at room temperature. This is 
clearly recited in claim 1 (upon which claims 7-9 are dependent) and claim 15 (upon 
which claim 19 is dependent). Thus, Appellant respectfully asserts the differences 
between the instant invention and the prior art of Keough, et al. and Little, et al. do 
appear in the claims. 
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In response to this argument, the Examiner states, "AppHcant argues that the 
reference of Keough and Little et al alone or in combination with one another neither 
discloses nor even suggest derivatizing polypeptides in aqueous solution, nor do they 
disclose or even suggest that the reagents for derivatization exhibit a half-life in aqueous 
solution of not less than 10 minutes at room temperature. This argument is noted but not 
found to be persuasive". 

In response, Appellant respectfully reasserts the arguments as to the 
inappropriateness of the rejection under the Keough, et al. reference and respectfully 
submits that the addition of the Little, et al. reference does nothing to remedy these 
deficiencies. Specifically, neither of these references alone or in combination with one 
another, discloses nor even suggests the claimed methodology inasmuch as the reagents 
are not taught. It is only with the teachings of the instant invention as presented in the 
captioned application that such is attainable. 

In view of the foregoing, Appellant respectfully submits that the Examiner's 
rejection cannot be sustained and should be reversed. 



Page 12 of 13 



Conclusion 

In view of the foregoing arguments. Appellant respectfully asserts that the 
Examiner's rejections cannot be sustained, and should be reversed. 

Respectfully submitted, 




Royal N. RonningTJr. 
Registration No.: 32,529 
Attorney for Appellant 



Amersham Biosciences Corp 
800 Centennial Avenue 
P. O. Box 1327 

Piscataway, New^ Jersey 08855-1327 

Tel: (732)457-8423 
Fax: (732) 457-8463 



I hereby certify that this correspondence is being deposited with 
the United States Postal Service as first class mail in an envelope 
addressed to: Mail Stop Appeal Brief - Patents, Commissioner 
for Patents, P.O. Box 1450, Alexandria, Virginia 22313-1450, on 
October 26. 2005 . 
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CLAIMS APPENDIX 



The Rejected Claims 

Claim 1 (original): A method of identifying a polypeptide, which method comprises the 
steps of 

(a) derivatizating, in an aqueous solution, the N-terminus of the polypeptide, or the 
N-termini of one or more peptides of the polypeptide, with at least one acidic 
reagent containing a sulfonyl moiety coupled to an ester moiety to provide one or 
more peptide derivatives, which reagent exhibits a half-life in aqueous solution of 
not less than 10 minutes at room temperature, to prepare one or more derivatives; 

(b) analyzing at least one said derivative using a mass spectrometric technique to 
provide a fragmentation pattern; and 

(c) interpreting the fragmentation pattern obtained to identify the polypeptide. 

Claim 2 (original): The method according to claim 1, wherein the acidic reagent has a 
pKa of less than about 2 when coupled to the polypeptide. 

Claim 3 (original): The method according to claim 1, wherein the mass spectrometric 
technique used in step (b) is matrix-assisted laser desorption ionization (MALDI) mass 
spectrometry. 

Claim 4 (original): The method according to claim 1, wherein the mass spectrometric 
technique used in step (b) is electrospray ionization (ESI). 



Claim 5 (original): The method according to claim 1, wherein in step (c), the 
fragmentation pattern is interpreted using a software program or database. 

Claim 6 (original): The method according to claim 1, wherein all the steps are conducted 
as part of an automated or semi-automated procedure. 

Claim 7 (previously presented): The method according to claim 1, wherein the acidic 
reagent includes an N-hydroxysuccinimide (NHS) ester. 

Claim 8 (original): The method according to claim 7, wherein the reagent comprises a 3- 
sulfopropionic acid N-hydroxysuccinimide ester. 

Claim 9 (original): The method according to claim 7, wherein the reagent comprises a 2- 
sulfobenzoic acid N-hydroxysuccinimide ester. 

Claim 10 (original): The method according to claim 1, wherein the polypeptide has been 
obtained by enzymatic digestion. 

Claim 1 1 (original): The method according to claim 10, wherein the enzyme is trypsin. 

Claim 12 (original): The method according to claim 1, which further comprises a step of 
protecting lysine residues prior to the derivatizating step. 



Claims 13-14 (cancelled) 

Claim 15 (original): A kit for identifying a polypeptide by a mass spectrometric 
technique, which kit comprises at least one acidic reagent comprising a sulfonyl moiety 
coupled to an activated acid moiety in a container, which reagent exhibits a half-life in 
aqueous solution of not less than 10 minutes, preferably not less than about 20 minutes 
and most preferably not less than about 30 minutes at RT. 

Claim 16 (original): The kit according to claim 15, wherein the reagent has a pKa of less 
than about 2 when coupled to the polypeptide. 

Claim 17 (original): The kit according to claim 15, wherein the mass spectrometric 
technique is matrix-assisted laser desorption ionization (MALDI) mass spectrometry. 

Claim 18 (original): The kit according to claim 15, wherein the mass spectrometric 
technique is electrospray ionization (ESI). 

Claim 19 (original): The kit according to claim 15, wherein the activated acid moiety is 
an N-hydroxysuccinimide (NHS) ester. 



Claim 20 (original): A kit according to claim 19, wherein the NHS ester is selected from 
the group consisting of 3-sulfopropionic acid N-hydroxysuccinimide ester and 2- 
sulfobenzoic acid N-hydroxysuccinimide ester. 




EVIDENCE APPENDIX 

Appellant Hereby appends a copy of U.S. Patent 6,322,970 to Little, et al. and a copy of 
international patent application number PCT/USOO/00790, which published as WO 
00/43792. This is the evidence relied upon by the Examiner for rejection of appealed 
claims. 
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INVENl OR(S) : Little et al. 

It is certified that error appears in the above- identified patent and that said Letters Patent Is 
hereby corrected as shown below: 



Title page . 

Please add to Item [*] Notice: - This patent is subject to a terminal disclaimer. - 
Item [56], References Cited, replace patent number "4,681,394" with -- 4,683,194 --. 
Add the following: 

- Sinha et al., Polymer support oligonucleotide synthesis XVIII: Use of beta- 
cyanoethyl-N,N-dialkylamino-/N-morpholino phosphoramidite of deoxynucieosides 
for the synthesis of DNA fragments simplifying deprolection and isolation of final 
products. Nucleic Acid Res. 12 :4539 (1984). -- 

Qilumn 77, 

Lines 24 and 25, claim 28 should read as follows: 

- 28. The process of claim 27, wherein the encoded polypeptide is isolated by 
reaction with an antibody. -- 
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PL 


Poland 






CN 


China 


KR 


Republic of Korea 


FT 


Portugal 






CU 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






cz 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


U 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







wo 00/43792 



1 



PCT/USOO/00790 



METHODS AND KITS FOR SEQUENCING POLYPEPTIDES 



5 FIELD OF THE INVENTION 

The present invention relates to methods and kits which enable polypeptide sequencing 
using mass spectrometric techniques. The methods and kits are particularly useful for identifying 
high molecular weight polypeptides for use in, for example, biological, pharmaceutical, personal 
cleansing, and fabric cleaning fields. 

1 0 BACKGROUND OF THE INVENTION 

Recently, matrix-assisted laser desorption ionization (MALDI) mass spectrometry and 
electrospray ionization were developed for high-sensitivity peptide and polypeptide sequencing 
applications. See, e.g., Spengler et al„ "Peptide Sequencing by Matrix-assisted Laser-desorption 
Mass Spectrometry*', Rapid Communications in Mass Spectrometry, Vol. 6, pp. 105 - 108 (1992); 

15 Spengler et al, "Fundamental Asjjects of Postsource Decay in Matrix-Assisted Laser Desorption 
Mass Specn-ometry*\ Journal of Physical Chemistry, Vol. 96, pp. 9678 - 9684 (1992); Kaufmann 
et al, "Mass Spectrometric Sequencing of Linear Peptides by Product-ion Analysis in a 
Reflectron Time-of-flight Mass Spectrometer Using Matrix-assisted Laser Desorption 
Ionization", Rapid Communications in Mass Spectrometry, Vol. 7, pp. 902 - 910 (1993); 

20 Kaufmann et al., "Sequencing of Peptides in a Time-of-flight Mass Spectrometer: Evaluation of 
Postsource Decay Following Matrix-assisted Laser Desorption lonisation (MALDI), International 
Journal of Mass Spectrometry and Ion Processes, Vol. 131, pp. 355 - 385 (1994); Kaufmann et 
al., "Post-source Decay and Delayed Extraction in Matrix-assisted Laser Desorption/Ionization- 
Reflectron Time-of-Flight Mass Spectrometr\*\ Rapid Communications in Mass Spectrometry, 

25 Vol. 10, pp. 1199 - 1208 (1996); and Spengler, "Post-source Decay Analysis in Matrix-assisted 
Laser Desorption/Ionization Mass Spectrometr>' of Biomolecules", Journal of Mass 
Spectrometry, Vol. 32, pp. 1019 - 1036 (1997); Carr et al., "Integration of Mass Spectrometry in 
Analytical Biotechnology", Analytical Chemistry, Vol. 63, pp. 2802 - 2824, (1991); Yates III et 
al., "Mining Genomes With MS'\ Analytical Chemistry, Vol. 68, pp. 534A - 540A, (1996); 

30 Morris et al., "High Sensitivity Collisionally Activated Decomposition Tandem Mass 
Spectrometry on a Novel Quadrupole/Orthagonal Acceleration Time-of-Flight Mass 
Spectrometer", Rapid Communications in Mass Spectrometry, Vol. 10, 889 - 896, (1996). 
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MALDI offers several advantages in the field of mass spectronietr\\ For example, 
MALDI provides higher sensitivity than conventional electrospray triple quadrupole equipment. 
When used in combination with time-of-flight mass analyzers. MALDI is also applicable to 
higher mass peptides than can be analyzed with triple quadrupole equipment. MALDI is also 

5 useful for analyzing complex mixtures with minimal sample purification. Electrospray ionization, 
on the other hand, is readily interfaced to powerful separation techniques including liquid 
chromatography (LC) and various forms of capillary electrophoresis (CE). Highly automated 
analyses are possible when using LC and CE as the sample purification and introduction devices. 
However, current MALDI and, to a lesser extent, electrospray ionization mass 

10 spectrometric methods fail to adequately offer predictable tandem mass spectrometry 
fragmentation pattems. For example, multiple ion series (including a-ions, b-ions, and y-ions) are 
typically observed, resulting in MALDI post-source decay spectra that are too complex for 
efficient interpretation and sequencing. Multiple ion series (b- and y-ions), plus internal 
fragments and both singly and multiply charged ions are formed from multiply charged precursor 

15 ions generated by electrospray ionization, and the resulting tandem mass spectra are often 
difficult to interpret de novo. Accordingly, problems with fragmentation have limited the abilit>' 
to rapidly sequence polypeptides using mass spectrometry. As a result, mass spectrometry, and 
particularly MALDI mass spectrometry, is of limited value in this area. 

Several research groups have attempted to improve the utility of mass spectrometry in the 

20 field of polypeptide sequencing through the use of chemical derivatization techniques. Such 
techniques have been utilized to promote and direct fragmentation in the MSMS spectra of 
peptides with the goal of increasing sensitivity and decreasing the complexity of the resulting 
spectra. Most of these known techniques provide cationic derivatives. See, e.g., Kidwell et al., 
"Sequencing of Peptides by Secondary Ion Mass Spectrometry", Journal of the American 

25 Chemical Society, Vol. 106, pp. 2219 - 2220 (1984) (derivatization with a quaternary ammonium 
group, analysis using the static SIMS ionization method (prior to development of both MALDI 
and electrospray ionization)). Application of such techniques using MALDI and electrospray 
ionization with low-energy collisional activation have not proven generally effective. 

More recently, researchers have utilized other derivatization techniques in an effort to 

30 enhance peptide / polypeptide sequencing using mass spectrometric methods. For example, it has 
been shown that oxidation of cysteine residues present in tryptic peptides (peptides produced via 
digestion with trypsin) may improve fragmentation using electrospray tandem mass spectrometry. 
See, e.g., Gaskell et al., "Role of the Site of Protonation in the Low-Energy Decompositions of 
Gas-Phase Peptide Ions", Journal of the American Society of Mass Spectrometry, Vol. 7, pp. 522 

35 - 531 (1996) and Gaskell et al., "Influence of Cysteine to Cysteic Acid Oxidation on the 
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Collision-Activated Decomposition of Protonated Peptides: Evidence for Intraionic Interactions", 
Journal of the American Society of Mass Spectrometry, Vol. 3, pp. 337 - 344 (1992). 
Specifically, y-ion fragmentation was promoted. However, this technique has several limitations. 
For example, the technique was not extended to MALDI methods. The technique is also limited 
5 to the analysis of those polypeptides having cysteine residues occurring in the sequence to be 
analyzed. Indeed, cysteine occurs rather rarely in naturally occurring polypeptides, placing a 
severe limitation on the utility of this technique. 

Accordingly, there is a need to provide a mass spectrometric method of sequencing 
polypeptides that is simple, efficient, and widely applicable to wild-type and variant polypeptides. 

10 The present inventors herein provide a method for high-sensitivity polypeptide sequencing using 
mass spectrometric techniques. The present inventors have discovered that polypeptides and 
peptides thereof derivatized with relatively strong acid groups will provide y-ion fragmentation 
nearly exclusively, resulting in spectra which are easily interpreted de novo. The present 
invention is also related to kits which are used to conveniently enable performance of the present 

15 method. 

SUMMARY OF THE INVENTION 
The present invention relates to mass spectrometric methods and kits which are 
particularly useful for sequencing polypeptides. The methods involve determining the amino acid 
sequence of a polypeptide, the steps comprising: 
20 (a) derivatizing the N-terminus of the polypeptide or the N-termini of one or more 

peptides of the polypeptide with one or more acidic moieties having a pKas of less 
than about 2 when coupled with the polypeptide or peptides, to provide one or more 
derivatized analytes; 

(b) analyzing one or more derivatized analytes using a mass spectrometric technique to 
25 provide a fragmentation pattern; and 

(c) interpreting the fragmentation pattern. 

The kits of the present invention comprise one or more acidic moiety reagents which 
provide acidic moieties having pKas of less than about 2 when coupled with a polypeptide; and 
means for derivatizing the N-terminus of the polypeptide or the N-termini of one or more peptides 
30 of the polypeptide with one or more acidic moiety reagents. The present kits are particulariy 
useful in conjunction with performance of the methods. 

DETAILED DESCRIPTION OF THE INVENTION 
The methods and kits of the present invention are useful for sequencing polypeptides 
including, for example, wild-type, variant, and / or synthetic polypeptides. The methods and kits 
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are particularly useful for identifying high molecular weight polypeptides for use in, for example, 
biological, pharmaceutical, personal cleansing, and fabric cleaning fields. 

The present methods and kits have widespread utility. Applications include, but are not 
limited to: facilitation of biological studies requiring rapid determination of peptide or 

5 polypeptide sequences; identification of post-translational modifications in proteins and for the 
identification of amino acid modifications in variant proteins such as those used in, for example, 
commercial laundry and cleansing products; aiding the design of oligonucleotide probes for gene 
cloning; rapid characterization of products formed in directed evolution studies; combinatorial 
chemistry and peptide library identification; and proteomics. 

10 The present method involves addition of one or more relatively strong acid groups to the 

N-terminus of the polypeptide or one or more peptides formed through cleavage of the 
polypeptide prior to mass spectrometric analysis. Without intending to be limited by theory, it is 
believed that the resulting negatively charged derivative(s) facilitate low energy charge-site 
initiated fragmentation. This is especially effective wherein the C-termini of the polypeptide or 

15 peptides thereof are basic or hydrophobic, preferably basic residues. Again, without limitation by 
theory, it is believed that wherein a basic residue is protonated during mass spectrometric 
analysis, the relatively strong acid group will be deprotonated, which counter-balances the 
positive charge at the basic residue. An additional proton to ionize the derivative will be 
substantially "free*' to randomly ionize the backbone amide groups of the polypeptide / peptide 

20 because the most basic residue is already protonated. Additionally, without limitation by theory, 
it is believed that wherein a second relatively strong acid group is incorporated into the 
polypeptide / peptide that both acid groups will be deprotonated, providing two substantially 
"free" protons to randomly ionize the backbone amide groups of the polypeptide / peptide. 

Utilization of the present method provides significant increases in fragmentation 

25 efficiencies. Furthermore, increased fragment ion signal-to-noise ratios are observed for 
derivatized peptides relative to non-derivatized peptides having the same sequences. The 
resulting simple PSD MALDI and electrospray tandem mass spectra can be routinely interpreted 
de novo. 

Publications and patents are referred to throughout this disclosure. All references cited 
30 herein are hereby incorporated by reference. 

All percentages, ratios, and proportions used herein are by weight unless otherwise 
specified. 

As used herein, abbreviations will be used to describe amino acids. Such abbreviations 
are described in Table I below. Further described in Table I are average amino acid residue 
35 masses which are useful for the practice of the present invention. 
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Table I 



Amino Acid 


Three-letter 
Abbreviation 


One-Letter 
Abbreviation 


Average Amino Acid 
Residue Mass (Da) 


Alanine 


Ala 


A 


71.1 


Arginine 


Arg 


R 


156,2 


Asparagine 


Asn 


N 


114.1 


Aspartic Acid 


Asp 


D 


115.1 


Cysteine 


Cys 


C 


103.1 


Glutamine 


Gin 


Q 


128.1 


Glutamic Acid 


Glu 


E 


129.1 


Glycine 


Giy 


G 


57.1 


Histidine 


His 


H 


137.1 


Isoleucine 


lie 


I 


113.2 


Leucine 


Leu 


L 


113.2 


Lysine 


Lys 


K 


128.2 


Methionine 


Met 


M 


131.2 


Phenylalanine 


Phe 


F 


147.2 


Proline 


Pro 


P 


97.1 


Serine 


Ser 


S 


87.1 


Threonine 


Thr 


T 


101.1 


Tryptophan 


Tip 


W 


186.2 


Tyrosine 


Tyr 


Y 


163.2 


Valine 


Val 


V 


99.1 



Definitions 

As used herein, the term "desorption ionization*' refers to the transition of an analyte from 
the solid-phase to the gas-phase as ions. 

As used herein, the term "desorption" refers to the departure of analyte from the surface 
and / or the entry of the analyte into a gaseous phase. 

As used herein, "ionization" refers to the process of creating or retaining on an analyte an 
electrical charge equal to plus or minus one or more electron units. 

As used herein, the term "M ALDI" refers to matrix-assisted laser desorption ionization. 

As used herein, the term "matrix" in reference to "MALDI" refers to small, acidic, light 
absorbing chemicals which may be mixed in solution with the polypeptide of interest, or peptides 
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thereof, of interest in such a manner so that, upon drying on the sample stage, the crystalhne 
matrix-embedded analytes can be successfully desorbed and ionized from the solid-phase into the 
gaseous or vapor-phase following laser irradiation. Alternatively, solutions of polypeptides, or 
peptides thereof, may be loaded onto appropriate matrices which are pre-dried on the sample 
stage. Non-limiting examples of suitable matrices include nicotinic acid, sinapinic acid, ferulic 
acid, caffeic acid, a-cyano-4-hydroxycinnamic acid, and a-cyano-4-hydroxycinnamic acid mixed 
with nitrocellulose. 

As used herein, the term *'electrospray ionization" refers to the process of producing ions 
from solution by electrostatically spraying the solution from a capillary electrode at high voltage 
with respect to a grounded counter electrode. The definition is intended to include both 
electrospray ionization and pneumatically assisted electrospray ionization, which is also referred 
to as ionspray. As used herein, the term "electrospray ionization" applies to all liquid flow rates 
and is intended to include microspray and nanospray experiments. Moreover, the definition is 
intended to apply to the analyses of peptides directly infused into the ion source without 
separation, and to the analysis of peptides or peptide mixtures that are separated prior to 
electrospray ionization. Suitable on-line separation methods include, but are not limited to, 
HPLC, capillary HPLC and capillary electrophoresis, Electrospray ionization experiments can 
be carried out with a variety of mass analyzers, including but not limited to, triple quadrupoles, 
ion traps, orthogonal-acceleration time-of-flight analyzers and Fourier Transform Ion Cyclotron 
Resonance instruments. 

As used herein, the term "polypeptide" refers to a molecule having two or more amino 
acid residues. The method of the present invention is suitable for sequence identification of high 
mass polypeptides. 

As used herein, the term "wild-type" refers to a polypeptide produced by unmutated 
organisms. 

As used herein, the term "variant" refers to a polypeptide having an amino acid sequence 
which differs from that of the wild-type polypeptide. 

Methods of the Present Invention 

The present method is useful for determining the amino acid sequence of a polypeptide. 
By using the phrase "determining the amino acid sequence", the present inventors do not intend 
to be limited to determining the entire sequence of a given polypeptide. Rather, by this phrase it 
is meant herein that a portion, portions, and / or the entire sequence is determined. 

The present methods involve addition of one or more relatively strong acid groups to the 
N -terminus of a polypeptide or one or more peptides thereof to produce one or more dcrivatized 
analytes for mass spectrometric analysis. The polypeptide / peptides are then analyzed using a 
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mass spectrometric technique to provide a fragmentation panem. The resulting fragmentation 
pattern is interpreted, thereby allowing sequencing of the polypeptide. 

The present inventors have discovered that the acidity of the derivatizing group(s) (acidic 
moiety) has a profound effect on the resulting mass spectra. Surprisingly, those acidic moieties 

5 having a pKa of less than about 2 when coupled with a polypeptide or peptide thereof will yield 
fragmentation patterns which are easily interpreted to provide the desired sequence information. 
The ordinarily skilled artisan is competent to measure the pKa values as described herein using 
standard methods known in the an. Non-limiting examples of such methods include, for 
example, titration and electrochemical methods. The preferred method of measuring pKa values 

10 is through titration. 

The method of the present invention may be performed as follows: 
Derivaiization of Polypeptide and / or Peptides of the Polypeptide 

An important feature of the present invention is derivatization of a polypeptide or 
peptides of a polypeptide of interest with one or more a relatively strong acids, i.e., acidic 

15 moieties having pKas less than about 2, preferably less than about 0, and more preferably less 
than about -2, when coupled with a polypeptide or peptides of the polypeptide. By "peptides of a 
polypeptide" it is meant herein that the polypeptide is digested or otherwise cleaved (herein 
collectively referred to as "digestion" or the like) into two or more peptides. The resulting 
peptides are derivatized in accordance with the present method. By "when coupled" it is meant 

20 herein that the pKas of the acidic moieties is defined as measured after being covalently bonded 
with a polypeptide or peptide herein. 

The polypeptide or peptides thereof may be produced by any means. For example, if 
necessary, the polypeptide of interest is isolated for analysis. Several procedures may be utilized 
for isolation including, for example, one-dimensional and two-dimensional gel electrophoresis. 

25 As another example, polypeptides may be synthesized through combinatorial chemistry methods 
well known in the art. 

Digestion may occur through any number of methods, including in-gel or on a membrane, 
preferably in-gel. See, e.g., Shevchenko et al., "Mass Spectrometric Sequencing of Proteins from 
Silver-Stained Polyacrylamide Gels", Analytical Chemistry, Vol. 68, pp. 850 - 858 (1996). 

30 However, it is possible to digest the polypeptide either enzymatically or chemically, preferably 
enzymatically. It is most preferable to utilize a digestion procedure which yields a basic or 
hydrophobic residue, most preferably basic, at or near the C-terminus of the resulting peptides. 
By "at" it is meant herein that the basic or hydrophobic residue is the C-terminal residue of the 
peptide. By "near" it is meant herein that the basic or hydrophobic residue is preferably within 

35 about 40 amino acid residues from the C-terminus of the peptide, more preferably within about 30 
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residues, even more preferably about 20 residues, and most preferably within 10 amino acid 
residues from the C-terminus of the peptide. 

While many methods may be utilized for this procedure, it is preferred to enzymatically 
digest the polypeptide using, for example, trypsin, endoproteinase Lys C, endoproteinase Arg C, 

5 or chymotrypsin, preferably, trypsin, endoproteinase Lys C, or endoproteinase Arg C. and most 
preferably trypsin. Trypsin, endoproteinase Lys C, and endoproteinase Arg C are preferable 
because the resulting peptides of the polypeptide will typically terminate at the C-terminus with 
an arginine or lysine residue (basic residues), with the exception, of course, of the original C- 
terminus of the polypeptide. Other enzymes are also suitable, especially if basic residues occur at 

10 or near the C-terminus of the resulting peptides. Chymotrypsin is also preferred for digestion, 
which typically cleaves at hydrophobic amino acid residues. Chemical digestion is also useful. 
For example, digestion with cyanogen bromide is useful. 

The method of the present invention may be adapted according to that described in 
Patterson et al., U.S. Patent No. 5,821,063, assigned to PerSeptive Biosystems, Inc., issued 

15 October 13, 1998, particularly with respect to the digestion techniques described therein. For 
example, a plurality of samples having different ratios of agent to polypeptide may be utilized and 
denvatized according to the present invention. 

However, digestion is not always necessary, particularly when sequencing (but certainly 
not limited to) small polypeptides. As used herein, "small" polypeptides include those having 

20 preferably less than about fifty amino acid residues, more preferably less than about forty 
residues, even more preferably less than about thirty residues, still more preferably less than 
about twenty residues, and most preferably less than about ten amino acid residues. 

For example, polypeptides may be characterized which are synthesized by well-known 
means, including combinatorial chemistry methods (a "synthetic polypeptide"). In this instance, 

25 it is most preferable to synthesize a polypeptide having basic or hydrophobic residue, preferably 
basic (most preferably arginine or lysine), at or near the C-terminus of the resulting polypeptide. 

The polypeptide (if the polypeptide is sufficiently "small" as defined herein above) or the 
peptides of the polypeptide are derivatized with one or more acidic moieties having pKas of less 
than about 2, preferably less than about 0, and most preferably less than about -2 (when coupled 

30 with the polypeptide or peptides) to provide a derivatized analyte. The acidic moieties of the 
derivatized analyte are prepared by coupling with an acidic moiety reagent. The acidic moiety 
reagent is not limited, provided an acidic moiety on the polypeptide or peptides thereof results 
having the herein described pKa. Non-limiting examples of acidic moiety reagents which may be 
utilized for coupling include, for example, dithiobis(sulfosuccinimidylpropionate), S- 

35 acetylmercaptosuccinic anhydride, 2-iminothiolane (which may also be referred to as Traut's 
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reagent), dithiodiglycolic anhydride, tetrafluorosuccinic anhydride, hexafluoroglutaric anhydride, 
sulfosuccinic anhydride, 2-suIfobenzoic acid cydic anhydride, chlorosulfonylacetyl chloride, and 
1,3-propane sultone. Use of reagents such as S-acetylmercaptosuccinic anhydride, 2- 
iminothiolane, and dithiodiglycohc anhydride require oxidation after derivatization with the 

5 peptide to produce the acidic moiety. Acidic moiety reagents not requiring the oxidation step 
include, for example, tetrafluorosuccinic anhydride, hexafluoroglutaric anhydride, sulfosuccinic 
anhydride, 2-sulfobenzoic acid cyclic anhydride and chlorosulfonylacetyl chloride. These 
reagents are often preferred, due to more efficient synthesis and / or lack of complicating 
oxidation of labile residues in the polypeptide or peptides thereof. Coupling an acidic moiety 

10 reagent to the N-terminus of a cysteine-containing peptide, followed by oxidation of the cysteine 
sulfhydryl group to cysteic acid, is one means to produce peptides containing two acidic moieties 
(sulfonic acids). 

The acidic moieties are most preferably a sulfonic acid. Among these, the more preferred 
acidic moieties include 2-sulfoacetyl moiet>', 3-sulfopropionyl moiety, and 2-sulfobenzoyl 
15 moiety. 

The use of disuifonic acid derivatives is also preferred. Use of the disulfonic acid 
derivatives preferably results in both sulfonic acids groups near the N-terminus of the peptide. 
For example, coupling an acidic moiety reagent to the N-terminus of a cysteine-containing 
peptide, followed by oxidation of the cysteine sulfhydryl group to cysteic acid is one means to 

20 produce peptides containing two acidic moieties (sulfonic acids). 

The ordinarily skilled artisan will have the ability to perform the relatively simple 
coupling procedures required by the present invention. For convenience, however, non-limiting 
examples of derivatization of polypeptides, or peptides of the polypeptide of interest, are 
illustrated as follows: 

25 Example 1 



2-Sulfobenzoic acid cyclic anhydride (commercially available from Aldrich Chemical 
Co., Milwaukee, WI) is prepared at a concentration of 0.1 M in dry tetrahydrofuran prior to use. 
The polypeptide ASHLGLAR (I nmol, commercially available from Sigma Chemical Co., St. 
30 Louis MO) (SEQ ID NO: 1) is diluted into 20 ^iL of 0.05 M trimethylamine. The 2-sulfobenzoic 
acid cyclic anhydride solution (2 ^L) is added and the reaction mixture is vortexed for 30 




0 
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seconds. The reaction proceeds for approximately 2 minutes at room temperature prior to dilution 
of the resulting derivatized analyte and mass spectral analysis. The concentration of the acidic 
moiety reagent is decreased by a factor of as much as 100 when derivatizing smaller quantities. 



ASHLGLAR (1 nmol, commercially available from Sigma Chemical Co., St. Louis MO) 
(SEQ ID NO: 1) is mixed with 2 of 0.02 M sulfoacetic acid, which is formed by mixing 2 
of neat chlorosulfonylacetyl chloride (commercially available from Aldrich Chemical Co., 
Milwaukee, WI) with 500 p.L of water. The mixture is dried and then reconstituted in 20 |iL of 
tetrahydrofuranrdiisopropylethyl amine (4:1 v:v). 0.1 M chlorosulfonylacetyl chloride (2 |iL, 
commercially available from Aldrich Chemical Co., Milwaukee, WI) in dry tetrahydrofuran is 
added and the mixture is vortexed for 30 seconds. The derivatization reaction proceeds for 
approximately two minutes at ambient temperature. The derivatized analyte is dried, 
reconstituted in 20 water and further diluted prior to mass spectral analysis. 
Chlorosulfonylacetyl chloride is also a useful reagent for derivatization of 2D gel isolates, but a 
modified synthetic procedure provides more consistent product yields. The modified procedure is 
discussed in Example 14. 



S-acetylmercaptosuccinic anhydride (commercially available from Aldrich Chemical Co., 
Milwaukee, WI) is prepared at a concentration of O.l M in dry tetrahydrofuran prior to use. 
ASHLGLAR (1 nmol, commercially available from Sigma Chemical Co., St. Louis MO) (SEQ 
ID NO: 1) is diluted into 20 ^L of 0.05 M trimethylamine. The S-acetylmercaptosuccinic 
anhydride solution (5 |iL) is added and the reaction mixture is vortexed for 30 seconds. The 
reaction proceeds for about two minutes at room temperature and is then oxidized with 10 ^L of 
formic acid (88%):H202 (30%) prepared at a ratio of 19:1 (v:v). Oxidation proceeds for 16 
hours at room temperature, and the sample is dried prior to dilution and mass spectral analysis. 



Example 2 




Example 3 



O 
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The concentration of the acidic moiet>' reagent is decreased by a factor of as much as 100 when 
derivatizing smaller quantities. 
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Example 4 



HN 



O NH 



b 



HoN-ASHLGLAR 



O 



OH 



N-ASHLGLAR 



H 



To ASHLGLAR (1 nmol. commercially available from Sigma Chemical Co., St. Louis, 
MO) (SEQ ID NO: 1) in 0.1 M trimethylamine (20 |iL) is added 2-iminothiolane (Traut^s reagent, 
commercially available from Aldrich Chemical Co., Milwaukee, WI) (3 ^L, 0.1 M in deionized 
water). The reaction proceeds for 5 minutes at room temperature. The product is oxidized with 3 
fiL of formic acid (88%):H202 (30%) prepared at a ratio of 19:1 (v:v). Oxidation proceeds for 5 
minutes at room temperature, and the derivatized analyte is dried prior to mass spectral analysis. 



The polypeptide CDPGYIGSR (commercially available from Sigma Chemical Co., St. 
Louis, MO) (SEQ ID NO: 2) is oxidized by mixing I to 5 nM of polypeptide (in 5 - 20 ^iL water) 
with 10 ^L of formic acid (88%):H202 (30%) prepared at a ratio of 19:1 (v:v). Oxidation 
proceeds for 30 minutes at room temperature, and the derivatized analyte is dried prior to mass 
spectral analysis. 



Dithiobis(sulfosuccinimidylpropionate) (DTSSP) (commercially available from Pierce 
Chemical Co., Rockford, IL) is taken up in phosphate-buffered saline at 50 nM / 20 fiL and the 
pH IS adjusted to 7.7 with 1 N NaOH. The solution (20 ^L) is added to 1 ^L of peptide HLGLAR 



Example 5 



O 



O 




Example 6 
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(ai 1 nM / p.L) (SEQ ID NO: 3) and allowed to react for 30 minutes. The reaction is quenched 
with tris-hydroxymethyl-aminomethane (0. IM, 20 ^L). The sample is desalted and oxidized with 
10 laL of formic acid (88%):H202 (30%) prepared at a ratio of 19:1 (v:v). Oxidation proceeds 
for 30 minutes at room temperature, and the derivatized analyte is dried prior to mass spectral 



Dithiodiglycohc acid (0.93 g. 5.1 mmol, commercially available from Sigma Chemical 
Co., St. Louis, MO) (alternatively, a polymer thereof of anhydride thereof (cyclic or acyclic) may 

10 be used) is dissolved in dichloromethane (20 mL) and placed under inert atmosphere. 
Dicyclohexylcarbodiimide (1.05 g, 5.1 mmol) is added in one portion. After about 96 hours, the 
precipitate is removed by filtration and the filtrate is concentrated in vacuo. The resulting 
material is taken up in diethylether and filtered. Again, the filtrate is concentrated in vacuo to 
provide dithiodiglycohc anhydride. 

15 Dithiodiglycohc anhydride (the cyclic form is showTi in this example) is prepared at a 

concentration of 0.1 M in dry tetrahydrofuran prior to use. ASHLGLAR (1 nmol) (SEQ ID NO: 
1 ) is diluted into 20 of 0.05 M trimethylamine. The dithiodiglycohc anhydride solution (5 )iL) 
is added and the reaction mixture is vortexed for 30 seconds. The reaction proceeds for about two 
minutes at room temperature. The resulting product is oxidized with 2 \xL of formic acid 

20 (88%):H202 (30%) prepared at a ratio of 19:1 (v:v). Oxidation proceeds for 30 minutes at room 
temperature, and the derivatized analyte is dried prior to mass spectral analysis. The 
concentration of the acidic moiety reagent is decreased by a factor of as much as 100 when 
derivatizing smaller quantities. 



5 analysis. 



Example 7 
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Example 8 




O 



+ H2N-ASHLGLAR 




SO3H O 



N-ASHLGLAR 



3-Sulfopropionic anhydride is prepared at a concentration of 0.1 M in dry tetrahydrofuran 
prior to use. ASHLGLAR (I nmol) (SEQ ID NO: 1) is diluted into 20 ^iL of 
5 tetrahydrofuran:diisopropylethylamine 4:1 (v:v). The 3-sulfopropionic anhydride solution (2 ^L) 
is added and the reaction mixture is vortexed for 30 seconds. The reaction proceeds for about two 
minutes at room temperature prior to dilution and mass spectral analyses. The concentration of 
the acidic moiety reagent is decreased by a factor of as much as 100 when derivatizing smaller 
quantities. 

10 Example 9 



Tetrafluorosuccinic anhydride is prepared at a concentration of 0.1 M in dry 
tetrahydrofuran prior to use. ASHLGLAR (1 nmol) (SEQ ID NO: 1) is diluted into 20 ^L of 0.05 
M trimethylamine. The tetrafluorosuccinic anhydride solution (2 |aL) is added and the reaction 
15 mixture is vortexed for 30 seconds. The reaction proceeds approximately two minutes at room 
temperature prior to dilution and mass spectral analysis. The concentration of the coupling 
reagent is decreased by a factor of approximately 100 when derivatizing smaller quantities. 



The polypeptide CDPGYIGSR (commercially available from Sigma Chemical Company, 
St. Louis, MO)(SEQ ID NO: 2) is mixed with 2 ^L of 0.02M sulfoacetic acid, which is formed by 
mixing 2 |iL of neat chlorosulfonylacetyl chloride (commercially available from Aldrich 




O 



Example 10 
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Chemical Co., Milwaukee, WI) with 500 of water. The mixture is dried and reconstituted in 
20 |iL of tetrahydrofuran:diisopropylethyl amine (4:1 v/v). 0.1 M chlorosulfonylacetyl chloride 
(2 fiL) in dry tetrahydrofuran is added and the mixture is vortexed for 30 sec. The derivatization 
reaction proceeds for about 2 min. at ambient temperature. The derivatized analyte is dried and 
5 reconstituted in 10 |aL of water. To that solution is added 10 jiL of formic acid 
(88%):H2O2(30%) prepared at a ratio of 19:1 (v:v). Oxidation proceeds for 5 min. ai room 
temperature, producing the derivatized peptide having two sulfonic acid groups near the N- 
terminus. 

Analysis Using a Mass Spectrometric Technique 

10 Upon derivatization, the polypeptide or one or more peptides of the polypeptide are 

analyzed using a mass spectrometric technique. While the technique utilized is not limited, the 
preferred techniques are post-source decay (PSD) matrix-assisted laser desorption ionization 
(MALDI) and electrospray ionization tandem mass spectrometry. See, e.g., Spengler et al., 
"Peptide Sequencing by Matrix-assisted Laser-desorption Mass Spectrometry", Rapid 

15 Communications in Mass Spectrometry', Vol. 6, pp. 105 - 108 (1992); Spengler et al., 
"Fundamental Aspects of Postsource Decay in Matrix-Assisted Laser Desorption Mass 
Spectrometry", Journal of Physical Chemistry, Vol 96, pp. 9678 - 9684 (1992); Kaufmann et aL, 
"Mass Spectrometric Sequencmg of Linear Peptides by Product-ion Analysis in a Reflectron 
Time-of-flight Mass Spectrometer Using Matrix-assisted Laser Desorption Ionization", Rapid 

20 Communications in Mass Spectrometry, Vol. 7, pp. 902 - 910 (1993); Kaufmann et al., 
"Sequencing of Peptides in a Time-of-flight Mass Spectrometer: Evaluation of Postsource Decay 
Following Matrix-assisted Laser Desorption Ionization (MALDI), International Journal of Mass 
Spectrometry and Ion Processes, Vol. 131, pp. 355 - 385 (1994); Kaufmann et al., "Post-source 
Decay and Delayed Extraction in Matrix-assisted Laser Desorption/Ionization-Reflectron Time- 

25 of-Fiight Mass Spectrometry", Rapid Communications in Mass Spectrometry, Vol. 10, pp. 1 199 - 
1208 (1996); and Spengler, "Post-source Decay Analysis in Matrix-assisted Laser 
Desorption/Ionization Mass Spectrometry of Biomolecules", Journal of Mass Spectrometry, Vol. 
32, pp. 1019 - 1036 (1997); Carr et al., "Integration of Mass Spectrometry in Analytical 
Biotechnology", Analytical Chemistry, Vol. 63, pp. 2802 - 2824, (1991); Yates III et al., "Mining 

30 Genomes With MS", Analytical Chemistry, Vol 68, pp. 534A - 540 A, (1996); Morris et al, 
"High Sensitivity Collisionally Activated Decomposition Tandem Mass Spectrometry on a Novel 
Quadrupole/Orthagonal Acceleration Time-of-Flight Mass Spectrometer", Rapid 
Communications in Mass Spectrometry, Vol. 10, 889 - 896, (1996). Most preferably, the 
techniques utilized are positive ion mode PSD MALDI and electrospray ionization tandem mass 
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spectrometn'. For convenience, Examples 11 and 12 below illustrate mass spectrometric 
techniques which may be utilized to analyze the polypeptide or peptides thereof. 

As the present inventors have surprisingly discovered, appropriately derivatized peptides 
or peptides of the polypeptide provide MSMS spectra predominantly characterized by y-ions. As 

5 is well-known in the art, y-ions indicate ionized fragments containing the original C-terminus of 
the polypeptide or peptide. As used herein, the term "y-ion" also includes (y-NH3) ions; to 
illustrate, incomplete digestion products containing a second basic (for example) residue often 
yield abundant (y-NH3) ions. Preferably, the spectra produced by this method are substantially 
free of a-ions and b-ions. A-ions and b-ions are formed by cleavages on either side of backbone 

10 carbonyl groups. Importantly, charge is retained with the N-terminal fragment with a-ions and b- 
ions. As used herein, the term ''substantially free of with reference to a-ions and / or b-ions 
means that compared to the dominant y-ion series, a-ions and b-ions have a collective relative 
abundance of less than about 20%, preferably less than about 10%, and most preferably less than 
about 5%. 

1 5 According to the method of the present invention, it is not necessary to analyze and / or 

identify all digestion products to identify the polypeptide thereof, particularly if the sequence of 
the polypeptide resides in a sequence database. 

Example 1 1 - PSD MALDI Sequencing Technique 
in this example, the mass spectrometric technique is carried out on a Voyager DE-RP or 

20 Voyager DE-STR (PerSeptive Biosystems Inc., Framingham, MA) (or a suitable equivalent) 
equipped with a N2 laser (337 nm, 3 nsec pulse width, 20Hz repetition rate). The mass spectra 
are acquired in the reflectron mode with delayed extraction. External mass calibration is 
performed with a low-mass peptide standard, and mass measurement accuracy is typically ± 0.3 
Da. The derivatized polypeptide or peptides are diluted to about 10 pM/fiL in 0.1% 

25 trifluoroacetic acid (TFA). The samples are then diluted five-fold to ten-fold further in a-cyano- 
4-hydroxycinnamic acid (alphaCN) which is prepared by dissolving 10 mg in 1 mL of aqueous 
50% acetonitrile containing 0.1% TFA. See, e.g., Beavis et al., "a-Cyano-4-hydroxyciTmamic 
Acid as a Matrix for Matrix-assisted Laser Desorption Mass Spectrometry*', Organic Mass 
Spectrometry, Vol. 27, pp. 156 - 158 (1992). Gel isolates are analyzed from thin film surfaces of 

30 a-cyano-4-hydroxycinnamic acid / nitrocellulose (alphaCN/NC) prepared by the fast evaporation 
method. See, e.g., Amott et al., "An Integrated Approach to Proteome Analysis: Identification of 
Proteins Associated with Cardiac Hypertrophy", Analytical Biochemistry, Vol. 258, pp. I - 18 
(1998). 

PSD MALDI tandem mass spectra are acquired for the derivatized analyte after isolation 
35 of the appropriate precursor ion using timed ion selection. The derivatized analytes can be 
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analyzed using a number of MALDl matrices including, but not limited to alphaCN. alphaCN/NC 
and 2,5-dihydroxybenzoic acid (DHB). Fragment ions are refocused onto the final detector by 
stepping the voltage applied to the reflectron. Typical voltage ratios v^hich may be used are as 
follows: 1.0000 (precursor ion segment), 0.9126, 0.6049, 0.4125, 0.2738, 0.1975, and 0.1213 

5 (fragment segments). The individual segments are combined (or "stitched together" as is 
commonly used in the art) using software commercially available from PerSeptive Biosystems, 
Framingham, MA ("PSD" is selected from the analysis window). All precursor ion segments are 
acquired at low laser power (variable attenuator = 1450) for < 256 laser pulses to avoid saturating 
the detector. The laser power is increased (variable attenuator = 1650) for all of the remaining 

10 segments of the PSD MALDl acquisitions. Typically, 256 laser pulses are acquired for each 
fragment ion segment. The data are acquired at a digitization rate of 20 MHz. 

Example 12- An Electrospray Ionization Tandem Mass Spectrometry Sequencing Technique 
In this example, the mass spectra are acquired using a capillary LC system (Perkin Elmer 
Biosystems, Foster City, CA) coupled to a LCQ ion trap mass spectrometer (ThermoQuest, San 

15 Jose, CA) equipped with a home-built microelectrospray source (uES). A 0.5 x 150 mm C18 LC 
column (Perkin Elmer Biosystems, Foster city, CA) is used with a flow rate of 5 fil/min. The LC 
mobile phases are water and acetonitrile, each containing 0.02% TFA. A typical gradient is 15% 
acetonitrile for 5 min., then 15-60% acetonitrile over 40 min. Flow rates of 0.5 iiVmm to the uES 
source and 4.5 |xl/min to a UV detector are achieved by placing a splitting tee (1/16", 0.25 mm 

20 bore, Valco, Houston, TX) after the LC column. The derivatized peptide or polypeptide samples 
are injected onto the LC column with an aulosampler (ALCOTT, model 719, Norcross, GA). The 
uES source is housed on an X,Y,Z micrometer (New Focus, Inc., Santa Clara, CA) mounted to 
the front end of the instrument. The microelectrospray needle is a PicoTip (FS360-50-15-D) 
from New Objective (Cambridge, MA). 

25 The electrospray tandem mass spectra are acquired using the following instrumental 

conditions: spray needle voltage 1.5 kV, heated capillary temperature 200 "^C, and collision 
energy 35 eV. A mass range of 300-2000 m/z is used in each fiill-MS scan. The electrospray 
tandem mass spectra are acquired using data-dependent scanning in the "triple-play" mode which 
consists of three sequential microscans: 1) a full MS scan, 2) a zoom on selected ions to 

30 determine charge states, and 3) a MS/MS scans on appropriate ions selected from the zoom scans. 
These three scan events are repeated throughout the LC run. 
Interpreting the Fragmentation Pattern 

The fragmentation pattern produced by the mass spectrometric analysis is interpreted to 
sequence the polypeptide. An artisan ordinarily skilled in the field of mass spectrometry will be 
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able to manually interpret the fragmentation patterns of small polypeptides de novo without the 
aid of commercially available software or sequence databases. Similarly, the artisan will also be 
capable of sequencing the peptides of the polypeptides (the digest products) de novo. 
Ahematively, the artisan may use kno\\'n aids for interpretation mcluding. for example, 
5 commercially available software or sequence databases. 

For example, sequences of the polypeptide or peptides thereof are efficiently and 
accurately determined with the y-ion fragmentation pattern produced via this invention. 
Identification of individual amino acid residues can be accomplished de novo by measuring mass 
differences between adjacent members in the y-ion series. Identification is then accomplished by 

1 0 comparing the measured mass differences to the known amino acid residue masses (see Table I 
herein above). For example, a measured mass difference of 71.1 Da corresponds to alanine. The 
reading direction is also established directly from the mass spectrum. The direction is from the 
C-lerminus to the N-terminus if measuring from low-mass to high-mass. The reading direction is 
fi-om the N-terminus to the C-terminus if measuring fi-om high-mass to low-mass. 

15 The sequences of the polypeptide, and peptides thereof, may also be efficiently and 

accurately determined using software which accepts mass spectral Augmentation data, either 
uninterpreted y-ion series masses or sequence tags derived from the y-ion masses, as inputs for 
sequence database searches. Such search software commonly utilized by the skilled artisan 
include, but are not limited to, *'Protein Prospector" (commercially available from the University 

20 of California at San Francisco or http://prospector.ucsf.edu) and "Peptide Search'' (commercially 
available from the European Molecular Biology Laboratory at Heidelberg, Germany or 
http://www.mann.embl-heidelberg.de). 

The fragmentation pattern produced by this invention can be searched against a number 
of sequence databases including, bui not limited to, the NCBI non-redundant database 

25 (ncbi.nlm.nih.g0v/blast/db.nr.2), SWISPROT (ncbi.nlm.gov/repository/SWISS- 

PROT/sprot33.dat.z), EMBL (FTP://ftp.ebi.ac.uk/pub/databases/peptidesearch/), OWL 
(ncbi.nlm.nih.gov/repository/owl/FASTA.z). dbEST 
(ncbi.nlm.nih.gov/repository/dbEST/dbEST.weekly.fasta.mmddyy.z) and Genebank 

(ncbi.nlm.nih.gov/genebank/genpept.fsa.z). The entire sequence of the polypeptide of interest 

30 can often be retrieved from the sequence database by searching the fragmentation data produced 
from one or more of the relevant peptide derivatives formed using the methods of this invention. 

Of course, when using database searching techniques, it is most efficient to limit the 
searches by specifying that only y-ions or (y-NH3) ions are allowed fragments because y- and (y- 
NH3) ions are the most prominent species observed in the fragmentation patterns wherein the 

35 present methods are utilized. Other fragment ion types like a-, b-, (bH-H20), (b-H20), (b-NH3) 
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and internal cleavage ions can be disallowed because they are not prominent in the spectra of the 
peptides derivatized using the methods of the present invention. The derivatives formed with the 
present invention provide simple fragmentation patterns that often yield greater database search 
specificity than can be obtained from the spectra of the same peptides without derivatization. 
5 The methods of the present invention are further illustrated in the non-limiting examples 

below. In these examples, the ordinarily skilled artisan will recognize that numbers of "candidate 
proteins" produced may differ from those disclosed herein as new proteins are continually added 
to databases. 

Example 13 

10 PSD MALDI tandem mass speco'ometry of high-mass polypeptides is demonstrated with 

oxidized insulin B-chain FVNQHLC(S0,H)GSHLVEALYLVC(S03H)GERGFFYTPKA (SEQ 
ID NO: 4) (MMcalc = 3495.9 Da) (commercially available from Sigma Chemical Co., St. Louis, 
MO). The mass of this polypeptide far exceeds the upper limit of most conventional triple 
quadrupole instruments. The tandem mass spectrum of the native polypeptide shows a relatively 

15 intense ion series consisting substantially of y-ions. All y-ions between y9 and y24 are readily 
observed. The sequence-specific fragments defining the N-terminus of the molecule, y25 to y29, 
are absent from the spectrum. 

The spectrum of this polypeptide is improved by derivatization using the acidic moiety 
reagent sulfobenzoic acid cyclic anhydride (commercially available from Aldrich Chemical Co., 

20 St. Louis, MO). The derivatized polypeptide shows significant enhancement of the y-ions 
derived from the N-terminal region of the molecule (y25, y26, y27, y28, and y29). A complete 
series of y-ions is observed from y8 to y29 following derivatization. No prominent b-ions are 
detected. 

Example 14 

25 A polypeptide separated by two-dimensional gel electrophoresis is in-gel digested with 

trypsin to produce peptides of the polypeptide. The peptides show several intense MH+ signals 
by MALDI including ions at m/z 1060.8, 1090.8, 1271.0, 1299.0, 1312.0, 1344.0, 1399.1, 1450.1, 
1460.1, and 1794.4. Database searching using the Protein Prospector Software against the entries 
in the NCBI protein sequence library produces a list of forty-one candidate proteins that match 

30 five or more of the input tryptic masses (search parameters: MW range 1,000 to 150,000, 
isoelectric point from 3.0 to 10.0, oxidized Met allowed as a side-reaction and a conservative 
mass tolerance of +/- 0.6 Da), 

The peptides of the polypeptide can be derivatized with any one of several of the reagents 
discussed in the previous examples. They include, but are not limited to, 2-sulfobenzoic acid 

35 cyclic anhydride, 3-sulfopropinoic anhydride or chlorosulfonylacetyl chloride. In this example. 
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chlorosulfonylacetyl chloride is used as the acidic moiety reagent. The derivatization procedure 
for low-level peptides isolated from 2D gel electrophoresis is modified to improve reproducibility 
and derivative yields. Typically, the peptide extract from the 2D gel is concentrated to near 
dryness (5 to 10 ^L) on a speed vac. The concentrates are acidified with 15 jiL of 0.1% TFA and 

5 cleaned up using commercially available CIS mini-columns (ZipTip™, Millipore Corporation, 
Bedford, MA 01730). The cleaned up sample is dried on a speed vac and reconstituted in 10 |iL 
of base (THFiDIEA 19:1 v/v). In this example, 2 jiL of a chlorosulfonylacetyl chloride solution 
(2 |iL of the neat liquid in 1 mL THF) is added and the reaction proceeds for 1 to 2 min at room 
temperature. The derivatized samples are again dried on a speed vac and reconstituted in 10 

10 of 0.1% TFA prior to analysis. At this stage, the sample can be mixed with MALDI matrix and a 
portion loaded onto the sample stage for analysis, or it may be cleaned up again using a 
commercially available CIS mini-column (ZipTip^", Millipore Corporation). The cleaned up 
sample can then be eluted directly from the ZipTip onto the MALDI sample stage in a 1 - 2 
volume of acetonitrile:0.1% TFA (1:1 v/v) containing 10 mg/mL of MALDI matrix. This latter 

15 procedure allows loading all of the recovered derivatives onto the MALDI sample stage for 
analysis thereby improving overall sensitivity of the method. PSD MALDI tandem mass 
spectrometry is carried out on the derivatized peptide weighing about 1572 Da. A sequence tag is 
obtained having y-ions at m/z 574.5, 661,7, 875.9, 1003.S. 1 103,9, 1204.6, 1304.1, and the (MH+ 
- derivative) ion at m/z 1451.0. The spectrum is searched against the protein sequence database, 

20 and six candidate proteins (all mitochondrial aspartate aminotransferases) are returned (search 
parameters: MW range 1,000 to 150,000, parent ion mass tolerance +/- 0.6 Da, fragment ion 
mass tolerance +/- 2.5 Da, Par(mi)Frag(av), allowed number of missed ions = 1). This level of 
specificity is obtained because the database search is constrained to consider y-ions as the only 
allowed fragments. This search constraint is possible because N-terminal and internal cleavage 

25 ions are not prominent in the tandem mass spectra of these derivatives (due to derivatization 
according to the present invention). Much less specificity is obtained (239 candidate proteins 
returned) when this same tandem mass spectrum is searched against the full library and all 
fragment ion types (a, b, (b + H20), (b - NH3), (b - H20), internal and y) are allowed. The 
peptide was identified as FVTVQTISGTGALR (SEQ ID NO: 5). 

30 Confirmation of the protein identification is sought by PSD MALDI of the derivative 

weighing about 1916 Da. The spectrum contains 7 fragment ions at m/z 724.6, 1 154.2, 1299.3, 
1439.0, 1551.9, 1665.5, and 1779.2, in addition to the protonated precursor ion. This spectrum is 
searched against the database assuming the fragments to be y-ions. The search does not return 
mitochondrial aspartate aminotransferase or any other candidate protein. Researching the 
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database allowing both y and (y - NH3) ions returns 16 candidate proteins, 10 of which are 
mitochondrial aspartate aminotransferases. This latter search confirms the protein identification. 
Inclusion of the (y - NH3) ions as possible fragments is required for this particular search because 
the peptide is an mcomplete tryptic product (ILIRPLYSNPPLNGAR) (SEQ ID NO: 6) that 
5 contains a second arginine residue. As stated herein above, derivatized incomplete digestion 
products often exhibit (y - NH3) fragment ions in the PSD tandem mass spectra. 

Example 15 

An in-gel tryptic digest of a protein isolated from 2D gel electrophoresis is analyzed by 
LC electrospray tandem mass spectrometry on an LCQ ion trap following derivatization 

10 according to Example 14 herein. All spectra are acquired unattended in an automated data- 
dependent mode using the "triple play'* sequence of microscans. The tandem mass spectrum of a 
derivatized tryptic peptide (MH+ = 971.5) shows several y-type product ions including m/z 401.1, 
538.3, 651.3, 750.4, The measured y-ions are used as inputs, along with the MH+ mass of the 
underivatized tryptic peptide (971.5 - 122 = 849.5), for database searching. Database searching 

15 using the Protein Prospector Software against the NCBI protein sequence library returns 3 
candidate proteins (search parameters: MW range all, parent ion tolerance +/- 0.6 Da, fragment 
ion tolerance +/- 1 .0 Da» monoisotopic parent and fragment ions, and no missed ions allowed). 
All three of the candidate proteins returned from the database search are haptoglobins. The 
peptide was identified as VVLHPER (SEQ ID NO: 7). 

20 Confirmation of the protein identification is sought by searching the database using the 

tandem mass spectrum produced from a second derivatized peptide (MH+ of derivative = 771.4 
Da). The spectrum showed several ions including y-type ions at m/z 322.1, 436.2 and 535.3. 
Using the same search parameters as above, three peptide sequences match the input data from 
the electrospray tandem mass spectrum. One of the three peptide sequences corresponds to that 

25 of haptoglobin. The identification of that peptide was established as NVNFR (SEQ ID NO: 8). 
This result confirms the identity of the protein. 

Example 16 

The method of the present invention is readily utilized to identify variant polypeptides. 
The MALDI mass spectrum of a tryptic digest of an enzyme isolate shows many tryptic masses 

30 identical to those of the commercial protease Savinase® (commercially available from Novo 
Nordisk. Copenhagen, Denmark). One of the expected tryptic peptides 
GVLWAASGNSGAGSISYPAR (MH+ = 1933 Da) (SEQ ID NO: 9) is missing from the 
spectrum, and an unknown ion is observed at m/z 1963. This suggests that the isolate is a 
Savinase® variant. Eleven different single amino acid changes could account for the observed 

35 +30 Da mass shift (4 G S, 4 A T, and 3 V -> E). The PSD MALDI tandem mass spectrum 
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is obtained following derivatizaiion such as illustrated in Example 2 herein. A complete series of 
y-ions is observed for the 21 amino acid peptide. The spectrum verifies that the glycine at residue 
14 is converted to a serine. The masses of all the y-ions are measured in this spectrum. The 
spectrum is automatically mterpreted using the peptide ladder sequencing program, which is 
included in the PerSeptive Biosystems MALDI data system. 

Example 17 

The use of disulfonic acid derivatives, with both sulfonic acid groups near the N-terminus 
of the peptide, is demonstrated by comparison of the MALDI PSD spectra produced from 
CDPGYIGSR (SEQ ID NO: 2) derivatized according to Example 5 and according to Example 10 
herein. The derivative formed by performic acid oxidation of the sulfhydryl group of the cysteine 
residue (Example 5) is analyzed using PSD MALDI from a matrix of alpha-cyano-4- 
hydroxycinnamic acid. The resulting PSD spectrum is dominated by the y? fragment ion formed 
by cleavage of the labile Asp-Pro amide bond. Lower mass y-type ions show relatively low 
abundance. For example, the abundances of the y3 and y4 ions relative to the y? ion are only 
about 8% and 5% respectively (peak height ratios). Without intending to be limited by theory, it 
is believed that the "mobile" ionizing proton is preferentially localized on the basic Asp-Pro 
amide nitrogen atom because the Pro amide nitrogen is more basic than the other backbone amide 
groups. This charge localization is thought to yield preferential fragmentation of the amide bond 
between Asp and protonated Pro. This problem is minimized by addition of a second sulfonic 
acid group near the N-terminus of the peptide according to Example 10. Addition of the second 
sulfonic acid group requires addition of a second "mobile" proton to produce the positively 
charged ion used for MALDI PSD analysis. Again, without intending to be limited by theory, this 
second proton is believed to be free to ionize other backbone amide groups because the relatively 
basic Pro group is already ionized. Protonation of other backbone amide groups leads to 
enhanced fragmentation of those groups and increased relative abundance of the corresponding y- 
ions in the MALDI PSD spectrum. In the PSD spectrum of the derivative formed according to 
Example 10, the abundances of the y3 and y4 ions increase to about 41% and 57% respectively 
(peak height ratios) relative to that of the y7 ion. 

Kits of the Present Invention 

A further embodiment of the present invention are kits which may be utilized to 
determine the amino acid sequence of a polypeptide. The kits comprise: 

(a) one or more acidic moiety reagents providing acidic moieties having pKas less than 

about 2 when coupled with the polypeptide or one or more peptides of the polypeptide; 

and 
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(b) means for derivatizing the N-terminus of the polypeptide or the N-termini of one or more 
peptides of the polypeptide with one or more acidic moiety reagents. 
The kits of the present invention may be adapted to a mass spectrometer in a similar 
fashion as the sample holder described m. for example, Patterson, U.S. Patent No. 5,827,659, 
5 assigned to PerSeptive Biosystems, Inc., issued October 27, 1998. 

Optionally, the kits may further comprise one or more verification peptides to test, for 
example, the accuracy of the mass spectrometric technique. Reference mass spectral data may 
also be optionally be included. 

The acidic moiety reagents which may be included in the kits are described above. 
10 Especially preferred means for derivatizing include those which allow convenient 

derivatization by the analyst or any other person interested in obtaining the derivatized 
polypeptide or peptides. A particularly preferred means for derivatizing comprises one or more 
containment devices to contain, for example, the acidic moiety reagent and ultimately the 
polypeptide / peptides of interest. 
15 Suitable containment devices include, for example, vials, tubes, pipette tips, plates, 

sample holders, and multi-well plates. Optional convenience is provided wherein the acidic 
moiety reagent resides within the containment device so that it need not be added by the analyst. 
For example, the acidic moiety reagent may reside on the inside of a pipette tip and activated as 
the polypeptide or peptides are pulled into the tip with a suitable buffer. The containment device 
20 is most preferably disposable, but need not be. 

The acidic moiety reagent may also be bound to a solid support. For example, the reagent 
may be support-bound inside a pipette tip. The polypeptide or peptides of interest may be taken 
up in an appropriate buffer system and repeatedly drawn over the bound reagent. After reaction, 
an appropriate quantity of the derivatized polypeptide or peptides may be loaded directly onto the 
25 MALDI mass spectrometr\' sample stage, or injected into an electrospray ionization mass 
spectrometry device. 

One or more buffer systems used to facilitate derivatization may also be included in the 
kits of the present invention. The buffer system appropriate for inclusion is dependent upon the 
acidic moiety reagent included. Examples of preferred buffer systems are disclosed in the 
30 derivatization examples above. Particularly preferred buffer systems include, but are not limited 
to, tertiary amine solutions (both aqueous and non-aqueous (for example, solutions in 
tetrahydrofuran)) and neat tertiary amines. Particularly preferred tertiary amines include 
trimethylamine, triethylamine, and diisopropylethylamine. 

The kits of the present invention may also comprise one of more digestion aids such as 
35 those described herein above. Digestion aids may be chemical or enzymatic. For example. 
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trypsin, endoproteinase Lys C, endoproieinase Arg C. and / or chymotrypsin, preferably, trypsin, 
endoproteinase Lys C. and / or endoproteinase Arg C, and most preferably trypsin may be 
included as a digestion aid. Chemical digestion aids, such as cyanogen bromide, may also be 
included herein. 
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IS CLAIMED IS: 

A method of determining the amino acid sequence of a polypeptide characterized in that 
it comprises: 

(a) derivatizing the N-terminus of the polypeptide or the N-termini of one or more 
peptides of the polypeptide with one or more acidic moieties having pKas of less than 
2, preferably less than 0. more preferably less than -2 when coupled with the 
polypeptide or peptides, to provide one or more derivatized analytes; 

(b) analyzing one or more derivatized analytes using a mass spectrometric technique to 
provide a fragmentation pattern, preferably the fragmentation pattern is substantially 
free of a-ions and b-ions; and 

(c) mterpreting the fragmentation pattern. 

2. A method according to Claim 1 characterized in that the mass spectrometric technique is 
MALDI PSD mass spectrometr>'. preferably positive ion mode PSD MALDI; or 
electrospray ionization tandem mass spectrometry, preferably tandem electrospray 
ionization mass spectromen7. 

3. A method according to Claim 1 or 2 characterized in that interpretation of the 
fragmentation pattern comprises using a commercially available software program or 
database. 

4. A method according to any of Claims 1-3 characterized in that the polypeptide is a 
synthetic polypeptide. 

5. A method according to any of Claims 1-4 characterized in that the peptides of the 
polypeptide are produced by digestion, preferably the digestion is chemical digestion, 
more preferably the chemical digestion is cyanogen bromide digestion. 

6. A method according to Claim 5 characterized in that the digestion is enzymatic digestion, 
preferably the enzymatic digestion is selected from endoproteinase Lys C digestion, 
endoproteinase Arg C digestion, tryptic digestion, and chymotryptic digestion. 



WHAT 
1. 
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7. A method according to any of Claims 1-6 characterized in that the acidic moiely is one 
or more sulfonic acids or a disulfonic acid derivative, preferably the acidic moiety is a 2- 
sulfoacetyl moiety, a 3-sulfopropionoyl moiet>', or a 2-sulfoben2oyl moiety. 

8. A kit for use in determining the amino acid sequence of a polypeptide characterized in 
that it comprises: 

(a) one or more acidic moiety reagents providing one or more acidic moieties having 
pKas of less than 2 when coupled with the polypeptide or one or more peptides of the 
polypeptide; and 

(b) means for derivatizing the N-temninus of the polypeptide or the N-termini of one or 
more peptides of the polypeptide with one or more acidic moiety reagents. 

9- A kit according to Claim 8 characterized in that the means for derivatizing comprises one 
or more containment devices, preferably the acidic moiet\' reagent resides within the 
containment devices; preferably the means for derivatizing further comprises a buffer 
system. 



10. 



A kit according to Claim 8 or 9 characterized in that it further comprises one or more 
digestion aids. 
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SEQUENCE LISTING 



<110> 



Keough, Thomas W. 
Youngquist, Robert S. 



5 



<120> Methods and Kits for Sequencing Polypeptides 

<130> Methods/Kits for Sequencing Polypeptid 

10 <140> 7379P2 

<141> 1999-09-29 

<160> 9 

15 <170> Patentin Ver. 2.0 

<210> 1 

<211> 8 

<212> PRT 

20 <213> synthetic construct 

<400> 1 

Ala Ser His Leu Gly Leu Ala Arg 
1 5 



<210> 2 

<211> 9 

<212> PRT 

30 <213> synthetic construct 

<400> 2 

Cys Asp Pro Gly Tyr lie Gly Ser Arg 
1 5 



<210> 3 
<211> 6 
<212> PRT 
40 <213> synthetic construct 

<400> 3 

His Leu Gly Leu Ala Arg 



25 



35 



1 



5 



45 



50 



<210> 4 

<211> 30 

<212> PRT 

<213> bovine insulin 
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Phe Val Asn Gin His Leu Xaa Gly Ser His Leu Val Glu Ala Leu Tyr 
15 10 15 

Leu Val Xaa Gly Glu Arg Gly Phe Phe Tyr Thr Pro Lys Ala 
20 25 30 



<210> 5 
<211> 14 
<212> PRT 

<213> pre adipocytes 
<400> 5 

Phe Val Thr Val Gin Thr lie Ser Gly" Thr Gly Ala Leu Arg 
15 10 



<210> 6 
<211> 16 
<212> PRT 

<213> pre adipocytes 
<400> 6 

lie Leu He Arg Pro Leu Tyr Ser Asn Pro Pro Leu Asn Gly Ala Arg 
15 10 15 



<210> 7 
<211> 7 
<212> PRT 
<213> rat lymph 

<400> 7 

Val Val Leu His Pro Glu Arg 
1 5 



<210> 8 
c211> 5 
<212> PRT 
<213> rat lymph 

<400> 8 

Asn Val Asn Phe Arg 
1 5 



<210> 9 
<211> 21 
<212> PRT 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Peptide 
<400> 9 

Gly Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Gly Ser He 
15 10 15 

Ser Tyr Pro Ala Arg 
20 
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